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Abstract 

We consider the following combinatorial problem. We are given three strings s, t, 
and t' of length L over some fixed finite alphabet and an integer m that is polylog- 
arithmic in L. We have a symmetric relation on substrings of constant length that 
specifies which substrings are allowed to be replaced with each other. Let A(n) denote 
the difference between the numbers of possibilities to obtain t from s and t' from s 
after n 6 N replacements. The problem is to determine the sign of A(m). 

As promises we have a gap condition and a growth condition. The former states 
that |A(m)| > ec m where e is inverse poly logarithmic in L and c > is a constant. 
The latter is given by A(n) < c" for all n. 

We show that this problem is PromiseBQP-complete, i.e., it represents the class of 
problems which can be solved efficiently on a quantum computer. 

1 Introduction 

Important computer-scientific challenges for quantum information science are to dis- 
cover efficient quantum algorithms for interesting problems and to understand the 
fundamental capabilities and limitations of quantum computation in comparison to 
those of classical computation. To approach these challenges it is important to bet- 
ter understand the complexity class BQP, the class of language recognition problems 
which can be decided efficiently on a quantum computer. For technical reasons it is 
necessary to consider a larger class, called PromiseBQP, as BQP is not known to con- 
tain complete problems. The difference between BQP and PromiseBQP is that in the 
latter only inputs from a certain subset of strings, the promise, have to be correctly 
classified as yes or no instances, whereas in BQP the promise is always the set of all 
strings (see [Tj [2] for more details on this difference) . 

Any computational model that is universal for quantum computing immediately 
leads to a complete problem for PromiseBQP; namely the problem to simulate that 
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model and to determine the output. The set of allowed input strings is then given by 
those for which the probability to obtain the answer YES can be guaranteed to be ei- 
ther greater than some fixed value or smaller than another value. So one can interpret 
the results proving that models such as adiabatic, topological, or one-way quantum 
computing are universal to be proving that the corresponding simulation problems 
are PromiseBQP-complete. However, such purely "quantum" problems do not really 
help us to understand the difference between quantum and classical computation. Be- 
sides we already know that a universal quantum computer could be used to simulate 
quantum systems. For this reason, an important challenge for complexity theory is to 
construct problems that seem as classical but still characterize the power of quantum 
computation. 

We now give a short overview of PromiseBQP-complete problems^. Ref. [3] char- 
acterizes the class PromiseBQP by the problem to determine the sign of so-called 
quadratically signed weight enumerators. These enumerators are polynomials whose 
coefficients are defined via some specific quadratic form with matrices and vectors con- 
taining the entries 0, 1. Another PromiseBQP-complete problem is formulated in the 
context of knot-theory. This problem consists in approximately evaluating the Jones 
polynomial of the plat closure of braids at certain roots of unity [4, 5j. This con- 
nection between quantum computing and knot theory is not unexpected since knot 
theory was successfully applied to topological quantum field theories and quantum 
computers were shown to be able to efficiently simulate topological field theories and 
vice versa [61 [7] . Extending the ideas of [5] it was shown in [8] that the problem to 
approximately evaluate the Tutte polynomial at certain points of the Tutte plane is 
also PromiseBQP-complete. 

In [2] we presented the following simple PromiseBQP-complete matrix problem: 
given a real symmetric sparse matrix A determine the sign of an entry {A m )ij. The 
promise is that absolute value is of the entry (A m )ij at least eb m . Here m is polynomial 
and e is inverse polynomial in the logarithm of the size of A and b is an a priori given 
upper bound on the operator norm of A. Moreover, we showed that the estimation 
of entries remains PromiseBQP-hard when restricted to matrices with —1, 0, and 1 as 
entries. 

There are two main reasons why we want to improve upon our previous results. 
First, we seek to give the problem a more combinatorial flavor by considering only (0, 1)- 
matrices, i.e., adjacency matrices of graphs. In [2j we required negative entries in order 
to be able to simulate interference which is likely to be essential for quantum computing. 
As it will become clear later, we can avoid all negative entries and "simulate" them by 
comparing entries of powers of (0, l)-matrices. Second, we want to describe a setting 
where the sparseness condition occurs in a natural way. Recall that we refer to a 
definition of sparseness which implies that there is an efficiently computable function 
specifying the positions and the values of the non-zero entries. In the present paper 
such an efficiently computable function is directly given by rules determining the edges 
of the graph. The vertices of the graph are strings and two vertices are defined to be 

1 Strictly speaking, these problems were not denned as promise problems. However, it is not to difficult 
to show that it is possible to reformulate them as promise problems and to show that they are PromiseBQP- 
complete. 
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adjacent if they can be obtained from each other by a substitution of substrings; which 
substrings can be replaced is specified by a given relation. 

To place our result into the context of known results on the relation between BQP 
and classical complexity classes we should mention that BQP is contained in the clas- 
sical counting complexity class AWPP [9]. Here we relax the problem of counting the 
number of certain walks in a graph to an approximate estimation of the difference be- 
tween two combinatorial numbers of this kind. The crucial point was to specify the 
demanded accuracy so that the corresponding problem is not only PromiseBQP-hard 
but also in PromiseBQP. 

The paper is organized as follows. In section [2] we define the problem formally 
and in section [3] we show that it can be solved efficiently on a quantum computer. 
This section resembles the corresponding section in [2]. Nevertheless, the fact that our 
string rewriting problem is in PromiseBQP does not strictly follow from any of our 
earlier results since our definition involves a promise that generalizes the upper bound 
b on the norm of the matrix A in [2]. For this reason, String Rewriting is not really 
an instance of our problem of estimating entries of sparse matrices. It appears as an 
instance of the latter only if one neglects the promises. For this reason, we had to 
rework the entire argument. 

The main part of the paper is Section [4] where we show that String Rewriting is 
PromiseBQP-hard. The idea is strongly motivated by physical intuition. We first en- 
code a quantum circuit solving a PromiseBQP-complete problem into a one-dimensional 
translational invariant interaction with finite interaction range. This interaction "Hamil- 
tonian" H describes a physical system whose autonomous time evolution runs the quan- 
tum computation. We construct the Hamiltonian in such a way that an entry of H m 
for appropriate m determines the solution of the computation. Various Hamiltonians 
of this kind have been described in the literature in a different context. The special 
feature of our construction of the Hamiltonian H distinguishing it from the known ones 
is that it acts (up to a scaling factor) like a matrix with entries 0, 1, —1. This makes 
it possible to define a (0, 1) matrix A in such a way that we can express the entries 
of H m by differences of entries of A m . The fact that H is translation invariant and 
has only finite interaction range is important to represent the constructed adjacency 
matrix by a string rewriting problem. 

2 Definition of the string rewriting problem 

Simply speaking, we consider the problem to estimate the difference between two com- 
binatorial quantities: Consider three strings s, t, and t! of equal length and a set of 
allowed modifications. These modifications correspond to replacements of substrings 
according to a given relation on the set of substrings. The problem is to decide whether 
there are more possibilities to obtain t from s or to obtain t' from s using exactly m 
substitutions. Let A s t f/(m) denote the difference between these numbers of possibil- 
ities. The problem is to determine its sign. However, we have to carefully describe 
the given promise. On the one hand, we have a lower bound on the absolute value of 
^s,t,t'ijn) f° r the specific m. This makes it possible to determine the sign of A Si t i t/(m) 
even though only an approximation of the actual value is known. On the other hand, 
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we have an upper bound on the growth of A s f t /(n) for all n. Later, it will become 
evident that our estimation procedure for A<, ^/(m) is quite sensitive to this bound. 
We define our PromiseBQP-complete decision problem as follows. 

Definition 1 (String rewriting) 

We are given three strings s, t and t' of length L over some finite alphabet A and 
a positive integer m = poly(L) as input. Moreover, we are given a relation ~ on 
substrings of length at most k where k is some constant. Replacing substrings u, v 
with u ~ v naturally gives rise to possible conversions of strings in A L . Let A be the 
adjacency matrix of the graph whose vertices are strings in A L and whose edges are 
given by the possible conversions. For n £ N let 



be the difference between the number of possibilities to obtain t from s and those to 
obtain t' from s after exactly n replacements. 
We are given the promises 



and | A Sj f it '(m)| > ec m where c > is a constant and e £ [0, 1] with e = l/poly(L). The 
problem is to decide if either A Sj t ; f/ (m) > or A Sj ^t'(m) < 0. 

It should be mentioned that the relation on the substrings is not necessarily an 
equivalence relation. Our construction of the PromiseBQP-hard instance in section [4] 
will, for instance, be based on a non-transitive relation. 

Our formulation of the problem deserves some explanations. In a regular graph 
such a difference between the number of walks can be directly interpreted as prob- 
ability differences in a discrete time random walk: If d is the degree, (A m ) s j/d m is 
the probability to arrive at t given that s was the position m steps earlier. Having 
this interpretation in mind, it is certainly natural to ask whether random replacements 
are more likely to lead to string t' or to string t after m steps. In order to obtain a 
decision problem, we need a promise on the absolute value of the difference so that 
even an inaccurate estimation can distinguish between both cases. This is just the 
usual way how promises are used to cast estimation problems as decision problems [1] . 
To understand the role of the growth condition in ineq. (pQ) is more difficult. Roughly 
speaking, it implies some statements on the spectrum of A making it possible to adjust 
the algorithm to the smallest scale on which A s tj f/ (m) can be estimated. 



3 The string rewriting problem is in Promise- 



The main contribution of this paper is to show that our string rewriting problem is 
PromiseBQP-hard. The proof that it is in PromiseBQP is based on quantum phase 

2 To show that string rewriting is PromiseBQP-hard it suffices to consider a relation with the following 
property. If u ~ v then |it| = \v\, i.e., both substrings have the same length. It can be shown that if the 
relations identify substring of different lengths, then the corresponding string rewriting problem is still in 
PromiseBQP. 
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estimation, following closely the proofs in [21 [10] . However, we emphasize that it 
cannot be directly reduced to the problems in these references since the promises of 
the problems are not compatible. Nevertheless, the idea of the the proof is quite similar 
but the quantum algorithm presented here is more general. 

Let us briefly rephrase the definition of PromiseBQP. The characteristic feature of 
promise problems is that a set II C {0, 1}* specifies the allowed input strings. 

Definition 2 (The complexity class PromiseBQP) 

PromiseBQP is the class of promise problems (LTyeSjIIno) that can be solved by a 
uniform^ family of quantum circuits. More precisely, it is required that there is a 
uniform family of quantum circuits acting on poly(r) qubits that decide if a string 
x of length r is a YES-instance or NO-instance in the following sense. The application 
of y( r ) to the computational basis state |x, 0) produces the state 

rW|x,0) = a x , |0) ® |Vx,o) + a x ,i|l) ® ^x,i) (2) 

such that 

1. for every x £ II yes it holds that |a x ,i| 2 > 2/3 and 

2. for every x G IIno it holds that |a x ,i| 2 < 1/3. 

Any output is acceptable if x n = IIyes U IIno- Equation has to be read as 
follows. The input string x determines the first r bits. Furthermore, k additional 
ancilla bits are initialized to 0. After Y^ has been applied we interpret the first qubit 
as the relevant output and the remaining r + k — 1 output values are irrelevant. The 
size of the ancilla register is polynomial in r. 

BQP is the class of problems in PromiseBQP for which the promise is trivial, i.e., 
IT is the set of all strings. 

To show that our string rewriting problem is in PromiseBQP we first introduce 
some notation and terminology. Let 

A = E X J®i 
j 

be the spectral decomposition of the adjacency matrix A and \(p) be an arbitrary 
quantum state. The spectral measure induced by A and | if) is a probability distribution 
on the spectrum of A such that the eigenvalue Xj occurs with probability {ip\Qj\<p}. 
We refer to any quantum process allowing us to sample from the spectral measure 
as measuring the observable A in the state \ip). This corresponds to von Neumann 
measurements of quantum observables. To solve our problem on a quantum computer 
we will identify strings s with basis vectors \s) of a quantum register. This can be done 
in a straightforward way by representing each symbol by a basis state on |7o<72(|«4|)l 
qubits. 

The first basic observation making it possible to solve the string rewriting problem 
is that for any self-adjoint matrix A measuring A m can be reduced to measuring A 

3 By "uniform" we mean that there exists a polynomial time classical algorithm that generates a sequence 
of a polynomial number of quantum gates for every desired input length. 
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since we can compute A m from the outcome A. Due to standard spectral theory we 
have 

&\A m \<p)=J2X?(<p\Q j \<p), 
j 

i.e., the left hand side is the mth statistical moment of the spectral measure induced by 
A. We can estimate this value by repeatedly applying A-measurements to the state \(p) 
and computing the average of the mth power of the obtained outcomes. The second 
observation is that A. a t t'(m) is equal to 

A^M = AZ- A™, = V2(s\A m \<p) = V2^+\A m \if,+) - ^-\A m \r)) , (3) 
where we have defined 

\<p) := ±={\t) - 

and 

\^) :=±(\a)±\<p)). 

The second equality in eq. ([3|) is obvious, the third one is the polarization identity for 
real quadratic forms. Eq. ([3]) shows that we can write the difference to be estimated 
as a difference between the mth statistical moments of the probability distributions 
induced by measurements applied to the states |?/' ± ). 

The algorithm consists of the following three steps. Later, we will modify the third 
step to make the scheme less sensitive to errors. 

1. Given an upper bound b on \\A\\, construct a quantum circuit implementing the 
unitary operator U := exp(— iA/b) approximately. This is known to be possible 
for all sparse matrices A [HI [12j [13] . 

2. To realize phase estimation for U proceed as follows. Replace the gates of U 
with corresponding controlled gates to obtain the controlled-?/ gate. Repeat this 
transformation j times to obtain the controlled- U J unitary matrices. The outputs 
of phase estimation are the eigenvalues of U uniquely identifying those of A. Hence 
we have an approximate implementation of the von-Neumann measurement of the 
observable A. 

3. Apply A- measurements to the states \^)- By raising each outcome to the mth 
power, we estimate the corresponding expected values of A m in these states by 
computing the average over repeated sampling. To see that an efficient prepa- 
ration procedure for \ip ) exist we observe that we could, for instance, generate 
superpositions between the first three basis states and apply any unitary opera- 
tion that maps these states to |s), \t), and \t'}. 

There are two delicate points deserving our attention. First, we can only implement 
an approximate measurement of the observable A. By taking the m-th power of the 
outcomes the inaccuracies could be amplified up to the scale given by the mth power 
of the largest measurement outcomes that are possible. Second, the statistical error 
caused by the fact that we estimate expected values after a polynomial number of 
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samples can only be bounded from above if we have prior knowledge on the probability 
of large eigenvalues. For these reasons we modify the above scheme in order to make it 
less sensitive to the occurrence of large measurement outcomes and to small fluctuations 
of these values. We rewrite eq. (J3]) as 



A 



s,t,t' 



m 



(4) 



where denote the probabilities of the eigenvalue Xj in the spectral measure induced 
by an d l^ - )) respectively. Note that the growth condition in eq. ([1]) ensures that 

— pj = for all j with Xj > c. If we could sample from the spectral measure 
exactly, we could just drop all outcomes A with A > c. But we can only achieve that 
the outcomes are close to the true eigenvalues with high probability. 

Let us define a function / by 



-c m ,c m ] 



f(x) := 



for 
for 
for 



x 
x 



< 
> 
> 



We can replace XJ 1 in eq. (jlj) with /(A*). Hence we have rewritten A Sjt)t /(m) as a 
linear combination of expectations of f(A) instead of A m . Estimating the expected 
value of f(A) by measuring A is less sensitive to errors than estimating A m . This 
is because / is Lipschitz continuous with constant mc m . The following argument 
shows that this implies sufficient accuracy for estimating the expected values of f(A). 
Assume for the moment that we apply the A-measurement to an eigenvector of A with 
eigenvalue Xj . We showed in (2[ [10] that there is an efficient method that guarantees 
the following precision. With probability 1 — 9 the outcome A is r/-close to Xj, where rj 
and 9 are inverse polynomial in the description length of the matrix A. Consequently, 
the probability that X m deviates from XJ 1 by more than r/mc m_1 is at most 9. This 
provides the following upper bound on the discrepancy between the expected value in 
our imperfect measurement and the expected value of a perfect measurement applied 
to the eigenvector: 

J? mc m - 1 + 2c m fl. (5) 

By linearity (see the analogue arguments in [2j QH]), the same estimation also holds 
for general states. By choosing 9 and 77 appropriately the error is at most ec m /2 for 
every inverse polynomial e. Using Hoeffding's inequality in straightforward analogy to 
the estimations of subsection 5.3 in [2] one sees that we can also make the error caused 
by finite sampling smaller than ec m /2 using a polynomial number of runs. Hence we 
obtain an estimation of A s t ^/(m) up to an error of ec m with any desired probability 
that is inverse polynomially close to 1. 

It is straightforward but technical to construct a quantum circuit including the com- 
putation of the averages after repeated sampling such that it solves String Rewriting 
in the sense of Definition [2j 
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4 String rewriting is PromiseBQP-hard 



To prove that our problem is PromiseBQP-hard we have to show how to encode a 
quantum circuit into a graph that is derived from string rewriting rules in the sense of 
Definition [H The proof is divided into four parts. First, we encode it into a transla- 
tionally invariant 3-local Hamiltonian H acting on a qudit chain. Second, we establish 
that a certain entry of H m (an appropriate power of this Hamiltonian) indicates the 
solution of the quantum circuit. Third, we convert this Hamiltonian H into an oper- 
ator H that acts like an adjacency matrix A on the relevant subspace such that the 
difference of certain two entries of A m indicates the solution. Fourth, we show how to 
choose the exponent m. 



where a z is the Pauli matrix acting on the output qubit. Elementary calculation shows 
[2] that 



where pj := |a x ,j| 2 is the probability of finding the output qubit in the state \j) for 
j = 0, 1. We now construct a Hamiltonian H such that (a\H m \uj) is equal to po — p\ 
in eq. ([6]) times some constant. Here a and oo define strings that represent the initial 
and the final states of our computation, respectively. Our goal to express certain 
entries of the powers of the Hamiltonian H as entries of the powers of an adjacency 
matrix imposes very special requirements on its form. This prevents us from using the 
construction in [TJ] since it is essential for our purposes to work only with Toffoli and 
Hadamard gates. Note that similar constructions have been described in the literature 
in a different context. The constructions in [15 (. 1161 [T7] showed that Hamiltonian time 
evolutions can in principle perform computations (which had also been an issue of 
thermodynamics of computation at that time) and those in |18t [T9] that even relatively 
simple Hamiltonians can perform universal quantum computing. Moreover, there are 
construction of Hamiltonians whose low-energy states encode QMA-complete problems 



Neither of the above constructions satisfies all the conditions required for our proof: 
we need a 1-dimensional translationally invariant Hamiltonian acting on a qudit chain 
with finite range interactions whose autonomous time evolution makes it possible to 
realize universal quantum computing and is spectrally isomorphic to a quantum walk on 
a line. The Hamiltonian H that we now construct will, at first, follow our construction 
in [21] (which was used to prove that a certain quantum measurement problem is 
PromiseBQP-hard) but will be modified later. It acts on a qudit chain of length L, 
i.e., on the Hilbert space 7i given by 



4.1 The Hamiltonian H 





(6) 



[Sol EH E21 [23] . 



n -.= nf L 
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where H c is the Hilbert space of a single cell. Each cell consists of a data and a program 
cell. Their Hilbert spaces are given by Hd = C 4 and 7i p = C 14 , respectively. The basis 
vectors of 7id are identified with the symbols 

{0,1}U{||,.} (7) 

and those of TC P with 

{I, H, S, T, B} U {I, H, S, T, B} U {□, ■ 0, #} . (8) 

We construct an operator V acting on Tif" 3 and use it to define our translationally 
invariant 3-local Hamiltonian H. The form of H is 

H = F + Fi = J2(V j + v}), 
j 

where Vj denotes the embedding of V such that Vj acts on cells j — 1, j and j + 1 of 
Hf . Accordingly, we address the three cells on which V acts by —1, 0, and 1. 

In analogy to the constructions mentioned above, the idea is that the forward time 
operator F simulates the computation in the sense that F m \a) represents the mth 
step of the computation. The program band contains the program code specifying the 
sequence of gates to be applied to the data register, a certain region of the data band. 
A block of n program cells corresponds to one time step (layer) of a parallelized circuit. 
Such a time step can consist of an application of several gates acting on disjoint qubit 
sets. To simulate one time step of the circuit, the code has to be first translated to 
the left by n cells so that the commands of that time step are exactly above the data 
register and then the commands have to be executed. All these operations have to be 
performed by the local action of the copies of V. 

V is a concatenation of two operators 

V = CX. 

The execution operator X implements logical transformations on the data band that 
are controlled by the state of the program band. These logical transformations include 
the implementation of proper quantum gates. The transition operator C implements 
transitions between configurations of the program band. These transitions realize the 
propagation of the program. Before we formally define the operators X and C we first 
describe the intuition behind how they are used to implement the computation. 

1. Without loss of generality we assume that U consists of Toffoli gates acting on 
triples of adjacent qubits, swap gates acting on pairs of adjacent qubits, and 
Hadamard gates. Moreover, we assume that the first two qubits are the control 
qubits of the Toffoli gate and the third qubit is its target qubit. This is because 
the Toffoli and the Hadamard gates form a universal set of gates provided that 
the Toffoli gate can act on triples of arbitrary qubits [26J and non- adjacent qubits 
can be brought together by applying a sequence of swap gates. 

Assume we want to use F to to simulate the computational process defined by 
the following quantum circuit U 
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|6i) — r—{K- 
N -r-T- 

|6 4 > — [|l 

and also require that the simulation immediately stops after the swap gate has 
been applied. In this case, we would initialize the program and the data bands 
as follows: 

(first step) (second step) 

######BH I THIHIS B I # (program) 
• ji • • • • I b± 62 63 64 || • • • • |! • (data) 

The program code consists of a sequence of several gate symbols I, S, T, H, one 
execution symbol ■. and two boundary symbols B. The gate symbols trigger the 
implementation of the corresponding gates. The blank symbol # fills the empty 
program cells. The initial logical state |x, 0) is written into the data cells marked 
by bi, ... , 64 using the symbols 0, 1. The symbols • and || to the left and to the 
right of the data register are used to control the propagation and execution of the 
program code. The formatting symbol || divides the data band into blocks. This 
makes it possible to ensure that the program is only executed after the program 
has been propagated by another n sites (one block) such that the next time step 
(i.e., a block of n commands) of the simulated circuit is implemented. The symbol 
• marks the remaining unused data cells. 

Initially, the first block of the program code is aligned with the data block in the 
sense that the gate symbols corresponding to the first time step of U are located 
above the data cells they act on. 

2. The execution symbol ■ is propagated cell by cell to the end of the program code. 
In each step, the symbol ■ and the gate symbol G to the right of it are swapped 
and the gate corresponding to G is applied to the relevant data cells. Note that 
in the initial configuration the execution symbol ■ is at the right of the left end 
boundary symbol B. This is because the only purpose of the left copy of B is to 
ensure that a "computation in backward direction" (i.e., the application of Ft) 
annihilates the initial state. 

3. Once the execution symbol ■ has passed the end of the program code it is con- 
verted to the blank symbol # via the intermediate symbol and a signal is sent 
left toward the begin of the program code. This signal indicates that the execu- 
tion of the first time step of the program has been completed. It propagates by 
converting each gate symbol G to G cell by cell. Once the marked gate symbol G 
is at the begin of the program code it triggers the conversion of # into the hole 
symbol □ via the intermediate symbol <)• 

4. The hole symbol □ propagates cell by cell to the end of program where it is 
converted to # and triggers the creation of the left propagating marker G via the 
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intermediate turn-around symbol <>• The arrival of this marked gate symbol at 
the begin of the program code triggers the conversion of the next copy of # into □. 
This procedure is repeated until the begin of the program code is aligned with the 
next copy of the format symbol || on the data band. In this situation, the marked 
gate symbol G triggers the creation of ■ instead of □ and the whole cycle starting 
in step 2 is repeated until the execution symbol ■ and the boundary symbol B 
meet above the data register, which leads to the termination of the program. 

To define the transition operator C, we need to introduce the following transition 
rules: 



2 a) 

3 
4 



6 a) 



□ G 




G □ 








□ # 

* 




# 
* 








G 




G # 








F G 




F G 








# G 




G 








# 
* 




# □ 

* 



b) 
b) 



G 



# 



G 


■ 







# 



b) 



# 



We use * to denote any of the symbols •, 0, and 1. The symbol represented by * is 
left unchanged by the corresponding transition rule. In the transition rules 2 and 6 the 
lower right and left corners are left empty to indicate that the symbol at that place is 
not important for the transition rule and that it is left unchanged. 

Transitions 1 a) and b) implement the rightward propagation of the symbols □ and 
■, respectively. Transitions 2 a) and b) take place when □ and ■ have passed the end 
of the program code, respectively. They create the turn-around symbol 0- Transition 

3 creates a marked gate symbol that initiates the leftward moving signal. Transition 

4 implements the propagation of this signal. Once this signal has arrived at the begin 
of the program code, transition 5 and transitions 6 a) and b) create the symbols □ 
and ■ via the creation of the intermediate turn-around symbol 0, respectively. The 
execution symbol ■ is created only if the turn-around symbol is exactly above the 
formatting symbol ||, which happens only if the blocks of the program code are aligned 
with the data register. Otherwise, the hole symbol □ is created. 

The transition operator C acts on Tlf 3 , implemens the above transitions between 
configurations, and annihilates all the configurations that do not appear to the left of 
the arrow symbol in any of the above rules. C is embedded such that the left side of 
each box refers to cell and the right side to cell 1. 

Before we define the execution operator X, we specify the conditions for imple- 
menting the Toffoli, Hadamard, swap gates, and "boundary" gates: 

- program cell contains the execution signal ■ 
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- program cell 1 contains the corresponding gate symbol G 

- neither • nor || is contained in data cells —1, 0, and 1 in the case of the Toffoli 
gate, in data cells and 1 in the case of the swap gate, and in data cell 1 in the 
case of the Hadamard and boundary gates. 

Define Pq to be the projectors onto the subspaces of TLf s spanned by the configu- 
rations for which the above three conditions are fulfilled for G £ {S,T,H,B} and 

Pt := 1 - P S - Pt - Pr - Pb ■ 

Let Uq be the unitary operators that implement the gate corresponding to G on the 
relevant data cells for G G {H,S,T}. More precisely, they implement the gates on the 
subspaces spanned by the logical symbols and 1 in the relevant data cells and act 
trivially on the orthogonal complement of this subspace and on the state space of the 
cells of the program band. 

The execution operator X is now defined by 

X := U r P T + U S P S + U u Ph + Pi . (9) 

Note that expression © does not contain an operation that is executed in the 
subspace corresponding to Pb- The operator X is constructed such that it leads to 
annihilation when the execution ■ and the boundary B symbols meet above the data 
register for the first time. 

4.2 Encoding of solutions into entries of powers of the 
Hamiltonian 

To show that certain entries of H m encode the solution of the PromiseBQP-complete 
problem for appropriately chosen m, we denote the initial state of the program and 
data bands by |a) and consider the vectors F l \a) that represent the different steps of 
the computation for i = 0, ... ,£ — 1. Here ^ — 1 is the number of steps it takes till 
the execution symbol ■ and the boundary gate symbol B meet above the data register 
for the first time, i.e., the very next step would annihilate the state. Observe that the 
vectors F l \a) are mutually orthogonal and have length 1 for i = 1, ...,£ — 1. This 
follows from the fact that only one transition rule can be applied to the configurations 
of the program band in the vectors F l \a) for i = 1, ...,£ — 2 and the configurations 
are all different for i = 1, ...,£— 1. 
Define 

|u>) := |x,0>® |x>, 

where the left component |x, 0} is the initial state of the data register and the right 
component |x) corresponds to the final basis state of the remaining part of the data 
band and the whole program band in the (£ — l)th step. Note that |cj) would be equal 
to -F^ _1 |a) if the operator V did not contain the execution operator X, i.e., if we only 
had V = C. We now obtain 

(a\F e ~ V) = (x, 0\U\x, 0} =p - Pl . 
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The following identities hold: (1) F^F i \a) = F i_1 |a) for i = 1, . . . ,£ - 1 and (2) 
Ft|a) = 0. To see that (1) holds, examine every transition rule in the backward 
direction and check that only one rule is applicable for each configuration in F l \a) for 
i = 1, ...,£ — 1. Statement (2) holds because a backward step would first swap the 
positions of ■ and B in the initial state |a) (performed by TT) and then annihilate the 
state (performed by X'). This is because the projectors Pt- Pr, Ps, and P\ annihilate 
a configuration containing the symbol pair BB if the boundary symbol B is above a cell 
of the data register. As already mentioned, we have F \a) = 0. These properties imply 
that the Hamiltonian H = F + F>, restricted to the span of F l \a) for i = 0, ...,£ — 1, 
is unitarily equivalent to 

e-i 

V:=Y J \i)^ + l \ + \j + l )^\^ 

i.e., the adjacency matrix of the line graph acting on C . The state \oj) consists of a 
component in the span of the vector i 7 ^ -1 ^) and a component perpendicular to the 
whole orbit F i \a). Its overlap with i^-^a) (corresponding to the last basis vector in 
C ) is given by po — p\. This finally proves that 

(a\H m \u) = (p -pi)(P m )o,i-i. 

We choose m to be odd if £ is even and vice versa so that ('P m )o^_i is strictly greater 
than for m > £ — 1. Therefore, the sign of the entry (a\H m \uj) determines the 
solution of the PromiseBQP-complete problem by indicating whether po —p\ is positive 
or negative. 

4.3 Derivation of the string rewriting rules 

The operator V = CX in eq. Q has only the values 1,0, ±l/\/2 as entries since X 
has these values as entries and T has only and 1 as entries. The former follows from 
the fact that the projectors in eq. Q act on mutually orthogonal subspaces and the 
unitary operators have 1,0, ±l/\/2 as entries. The latter is due to the fact that C 
describes transitions between basis states. Recall that the values ±l/\/2 stem from 
the Hadamard gate and the value 1 from transitions that permute basis states. We 
derive the string rewriting rules by first converting V to V and then V to V in a way 
that the corresponding Hamiltonians are still able to compute the solution. 

The operator V contains 0, ±l/\/2 as entries and V only 0,1. We interpret the 
basis states on which V operates as substrings; an entry 1 in V then indicates that the 
corresponding substrings can be exchanged. 

To obtain V, we substitute the value 1 in V by matrices having only =bl/v2 as 
entries. This can be done by introducing an auxiliary band 7if L where 7i a := C 2 
and performing a "dummy" Hadamard gate accompanying each step that does not 
involve the execution of the Hadamard gate on the computational subspace of one 
of the cells of the data register. However, this additional dummy Hadamard gate is 
less harmless than it may seem at first glance. Recall that in our model the output 
is computed from the overlap of the final state -F^ -1 |a) and some basis state. By 



13 



performing additional Hadamard gates on the auxiliary band the propagating signal 
would permanently trigger the creation of new superpositions |±) := (|0) ± |l))/\/2 in 
the auxiliary band. This would exponentially decrease the overlap to any basis state. 

Therefore, we must ensure that the additional dummy Hadamard gate is always 
applied to the same qubit. Since the signal propagates the qubit cannot be physically 
the same. But it suffices to move the information with the signal. To achieve this, we 
distinguish between right-moving (rule 1), left-moving (rules 3, 4 and 5), and stationary 
transitions (rules 2 and 6). Note that these terms refer to the direction in which 
the signal propagates and not the direction in which the program code moves. We 
denote the projections onto the subspace of H® 3 <8> H® 3 <8> 7i® 3 corresponding to these 
configurations by Q r , Qi, and Q s , respectively. Let S-ip and So t i be the swap gates 
between the auxiliary qubits in cells —1,0 and those in cells 0,1. Let denote the 
dummy Hadamard gate acting on the auxiliary cell 0. To simplify notation we assume 
that these swap operators, the dummy Hadamard gate and the operators C, Pq, Uq, 
and Pi are now all embedded into H® 3 H® 3 <8> H® 3 . 

The modified operator V is 

V := C (5 0) i Q r + Qs + S- lt0 Qi) {W R (U T P T + U S P S + Pi) + U u Pr) ■ (10) 

The operator (So,i Q r + Qs + S-i^Qi) swaps either the auxiliary qubits in cells — 1 
and or those in cells and 1 depending on the direction in which the signal moves. 
The operator 

X := W R (U T P T + U S P S + Pi) + U h Ph 

applies the correct gate to data cells and the dummy Hadamard gate Hh iff no 
Hadamard gate is applied to the cells of the data register. To formally verify that 
V contains only the entries 0, ±l/\/2 (as already explained in an intuitive way) we 
recall that 

C := C {Sq^i Q r + Q s + S-ifiQi) 

is a permutation matrix. Therefore, we only have to check the entries of the right hand 
term X. We add the trivial expression OPb to X. The projections Pt, Ps, Pl, Ph, 
and Pb correspond to a partition of the basis states of (H p <8> H-d <8> Ha)® 3 into equiv- 
alence classes. The corresponding partition of the rows of X decomposes X into the 
submatrices Wh^tPt, WnUsPs, W-rPi, UrPu plus some trivial rows (corresponding 
to Pb) that contain only zeros. The non-trivial submatrices, are in turn, submatrices 
of the unitary matrices WhUt, Hh^s^ Wh, and Ur. Since they all contain exactly one 
Hadamard gate and only permutations apart from this, the statement is clearly true. 
The corresponding Hamiltonian H is 

* ■= + v h • 

j 

where Vj denote the copies of V acting on cells j — 1, j, and j + It is important that 
the entries of H m still indicate the result of the computation. Set 

\a) := \a)®\0)® L and \G>) := \u) <g> \0}® L , 
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where the state \0)® L denotes the initial state of the auxiliary band. It is readily 
verified that 

(a\H m \u) = -j±={a\H™\u) = -^=( Po - Pl )(V m )o,i~i , 

where the parameter d is set to if the number of dummy Hadamard gates is even and 
to 1 otherwise. In other words, an odd number of dummy Hadamard gates decreases 
the overlap by the factor 1 / y/2. 

We now convert V to V . The first idea to do that is as follows. Observe that 
the Pauli matrix a x applied to |— ) acts like a multiplication with —1. To use this 
observation for simulating negative signs, we introduce an additional band Hf L where 
H s := C 2 . We refer to H s as "minus-sign-simulator" or "simulator" for short. 

We write the logical Hadamard gate C/h in eq. (jlOp in the form 

c% = if ®(5>iii>oi)®i? 3 > 

ij 

where i, j denote the basis states in Hf 3 and faj G {0, ±l/\/2}. The symbols l p and 
l a denote the identity on program and ancilla cells, respectively. 
We define the operators Mij acting on Hf 3 by 

M i:j = I®a x ®I if Hij = -1/V2 
Mij = J® J® J if mj = l/y/2 
Mij = if fHj = 

We now define the modified operator Un acting non- trivially on ftf 3 ® Hf 3 (but 
formally defined on Hf 3 Hf 3 ® Hf 3 ® Hf 3 ) by 

U n := If ®(^K)(i|®^-)®lf ■ 

ij 

In the same way, we modify the dummy Hadamard gate to Wh- 

If we defined a modified version V of V by replacing Wu and Un with Wh and 
C/h, respectively (and replace the remaining operators in eq. (jlOp with their canonical 
embeddings into the extended system) we would already obtain an operator with entries 
0, 1. It would act just like V provided that the simulator cell in the middle is in the 
state |— ). To simulate the Hamiltonian H by a modified version H in this manner 
we would then have to initialize every simulator cell in the state |— ). However, this 
would only allow us to reconstruct the original overlap {a\H m \ib) by extending \a) 
and \Cj) with the multiple superposition |— )® L . This overlap would correspond to a 
linear combination of an exponentially large number of entries of H m as opposed to 
our intention to reduce the problem to comparing only two entries. For this reason, 
we initially prepare only one simulator cell in the superposition |— ) and propagate the 
superposition to the "active" cell, i.e., the cell in which either of the Hadamard gates 
is currently executed. In analogy to the swapping on the auxiliary band we define 

V := C {So,iS' 0)1 Q r + Qs + S- lfi SL lfi Qi) {Wh(U t P t + U S P S + Pi) + UrPu) , 
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where the swaps S' act on the simulator band. The particular form of V ensures that it 
only contains 0, 1 as entries. To show this formally we use similar arguments as above 
and recall that the term 

C := C (S^iSg^ Q r + Qs + S-iflS'_ 10 Qi) 

is only a permutation matrix. For the remaining term 

X := W h (U t Pt + U S P S + Pi) + U H P H 

we again define a decomposition into submatrices WuUtPt, WuUsPs, Wh-Plj and 
{/h-Ph- They are, in turn, submatrices of Wh^Ti WuU$, Wu, and IT H . These matrices 
certainly have only the entries 0, 1 since they have been obtained from matrices with 
entries 0, ±l/\/2 by the rule described above. 

Note that V + V* is an adjacency matrix because there is no basis state \b) of 
H® 3 ®ftf 3 ®Hf ®Hf 3 such that V\b) ^ and V^\b) + 0. This can easily be checked 
by verifying that there is no configuration in cell —1, 0, and 1 of the program and 
data bands such that one transition rule applies in forward and another (or the same) 
transition rule applies in backward direction. 

The matrix V + V* defines a relation on the substrings of length three over the 
alphabet A of 224 symbols that correspond to the basis states of 7i p <g> TCd <g> TC a ® Ti. s 
as follows. Let a, r € .A 3 be two arbitrary substrings. Then a can be replaced by r 
and vice versa iff (a\V + V*\t) = 1. 

This collection of allowed replacements of substrings of length 3 gives rise to an 
adjaceny matrix A with strings in A L as vertices in the following way: A s t = 1 iff 
there is a triple of consecutive positions j — + 1 with j = 2, . . . ,L — 1 such that 
(1) s and t differ at most at these positions (2) the substring Sj-iSjSj + \ can be replaced 
by the substring tj-itjtj+% in the sense of the relation specified by V + V'. 

The modified Hamiltonian H is 

Note that this Hamiltonian can have entries greater than 1 in contrast to A. For 
example, this can occur if we can go from some string s to some other string t by 
applying two different substitution rules on two different triples. But it is clear that 
H Si t > iff A s t = 1. However, the adjacency matrix A and the Hamiltonian H are 
equal when restricted to the span of F l \a). This is because the transition rules and 
the initial state were defined such that there is always exactly one possible transition 
in forward time direction (see the properties of the states F l \a) that are discussed at 
the beginning of Subsection 14. 2b . 

Appropriate entries of A m encode the solution of the computation. To see this we 
define the states 



a ) 


:= d) 


(8) 


|0- 


•000- 


•0) 


Oil) 


:= d) 


8) 


|0- 


•010- 


•0) 




:= \G>) 
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|0- 


•000- 


•0) 


UJl) 


■■= \&) 




|0- 


■010- 


•0) 
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where the tensor components on the right hand side refer to the simulator band and 
the symbol 1 is above the execution symbol in \a) and \u>), respectively. 

Using the superpositions |a±) := (|cti) ± \ao))/-\/2 and \u±) := \uji) ± \u>q})/\/2 we 
can reproduce the relevant entry of H m : 

{a_\A m \uo_) = {a_\H m \uj_) = s[T {a\H m \Co) . (11) 

The factor y/^" occurs because the entries l/y/2 and —l/y/2 in H have been replaced 
by the matrices I and a x , respectively, that have eigenvalues of modulus 1. Note that 
we have (a_ \H m \uj + ) = because the computation never transforms the superposi- 
tion |— ) into |+) in the simulator band. Using |t^o) = + \ UJ -))/V 7 ^ we obtain 
(ct-\H m \oj-) = \/2(a^\H m \uj }. Putting everything together, we obtain the desired 
result 

(a \A m \u ) - <ai|^> > = (a \H m \u ) - ( ai \H m \uj ) (12) 

= V2 m (a\H m \u) (13) 

= Vr i ( Po - Pl )^=(V m )o,i-i- (14) 
Vd+l 

The basis states |ao), and \a±) define our strings of interest via s := u, t := a±, 
and t' := oq. Then we have 

A s>t , t ,(m) = (A m ) st -(A m ) st , 

= (a \A m \u ) - <ai|A m |wo) 

= vr(p - P1 )-^=(vno,i-i, (15) 

i.e., the sign of A Sjt)t /(m) depends on the solution computed by the original circuit. We 
have to show that the gap and growth conditions are satisfied to complete the proof 
that the so constructed instance of our string rewriting problem determines whether a 
given string x G II is either in Uyes or in IIno 



4.4 Appropriate choice of the power m 

To choose m appropriately and to check that both conditions are satisfied, we use the 
spectral decomposition of the adjacency matrix of the line graph 

l-\ 

V = J2 X j\ e j)( e j\ ■ 
j=o 

The eigenvalues are Xj = 2 cos ( ^jipp- ) for j = 0, . . . , The normalized eigenvecotrs 
are 

t-i 

l e i) : =^2 e 3,k\ k ) 

k=0 

with 

'7T0' + l)(fc+l) 



/ TTU + iKfc + m 

l + l sm { t + l ) 
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for j,k = 0,...,£-l [27]. 
Define the (signed) weights 

wj := (0\ej)(ej\£ - 1) = e Jj0 e^_i for j = 0, . . . ,£ - 1 . 

We have 



£-1 

m 

i=0 



The upper bound 

(O0/-1 < K (16) 
follows from |Aj| < Ao for j = 0, . . . , I — 1 and 

£M<i- 

3=0 

The latter bound follows by elementary linear-algebraic arguments from the fact that 
the eigenvectors \ej) are orthonormal. 
We obtain the growth condition 

1 



|A SiS , jt (n)|<^=(x/2-Ai) 



by applying the upper bound on (V m )o t e-i in eq. (fT6|) to eq. ([IB]) . 

To derive a lower bound on (7 ?m )o^-i, we consider the two eigenvalues of largest 
modulus, i.e., Ao = 2 cos(7r/£ + 1) and A^_i = — 2cos(7r/£ + 1). The corresponding 
weights and satisfy the relation 

since 

sin ( ^ = sin ( (I - 1)* + v^r) = - ' * 



Recall that m is even if £ is odd and vice versa so that Xowo = ^e-i w e-i > 0. Observe 
that |Aj| < A x for j = 1, . . . ,£ - 2. 
We now obtain the lower bound 



(V m )o,e-i = 2A^o + 5>"V 



1-2 

m 

'j ^3 
3=1 

1-2 

> 2X^w - X? \w 

3=1 

> 2X^w -X r [ l 

■\i\ m 



> X^w (17) 
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if m is chosen such that (Ai/Aq)" 1 < wq. We set £:=£ + ! and m := £ 3 and obtain 



/Ai\ m < / cob(27t/M 
vW ~ \ cos(tt/1) J 




( i - (^) 2 A 

^1 -|(7r/I)^ 



The term inside the square brackets converges to e 71-2 / 2 5 hence the entire term con- 
verges to zero. Inserting eq. (fTTl) into eq. (fT5l) we obtain 



observing |po — Pi| > 1/3 due to the promise that either po > 2/3 or po < 1/3- Using the 
Taylor expansion of the sine function one verifies easily that wq converges only inverse 
polynomially to zero. With e := wq the gap condition in the sense of Definition Q] is 
satisfied. 

5 Conclusions 

We have described a purely classical combinatorial problem that characterizes the 
complexity class PromiseBQP, i.e., the class of problems that can be solved efficiently 
on a quantum computer. Given that BQP^ BPP, our result shows that the quantum 
computer outperforms the classical computer in estimating differences of combinatorial 
quantities. 
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